Dual-Word Embedding Model Considering Syntactic Information for Cross-Domain Sentiment Classification

نویسندگان

چکیده

The purpose of cross-domain sentiment classification (CDSC) is to fully utilize the rich labeled data in source domain help target perform even when are insufficient. Most existing methods focus on obtaining transferable semantic information but ignore syntactic information. performance BERT may decrease because transfer, and traditional word embeddings, such as word2vec, cannot obtain contextualized vectors. Therefore, achieving best results CDSC difficult only or word2vec used. In this paper, we propose a Dual-word Embedding Model Considering Syntactic Information for Cross-domain Sentiment Classification. Specifically, dual-word embeddings using word2vec. After performing embedding, pay closer attention information, mainly self-attention TextCNN. embedding obtained, graph network used extract document, mechanism important aspects. Experiments two real-world datasets show that our model outperforms other strong baselines.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification

We present a method that learns word embedding for Twitter sentiment classification in this paper. Most existing algorithms for learning continuous word representations typically only model the syntactic context of words but ignore the sentiment of text. This is problematic for sentiment analysis as they usually map words with similar syntactic context but opposite sentiment polarity, such as g...

متن کامل

Learning Bilingual Sentiment Word Embeddings for Cross-language Sentiment Classification

The sentiment classification performance relies on high-quality sentiment resources. However, these resources are imbalanced in different languages. Cross-language sentiment classification (CLSC) can leverage the rich resources in one language (source language) for sentiment classification in a resource-scarce language (target language). Bilingual embeddings could eliminate the semantic gap bet...

متن کامل

A POS-based Ensemble Model for Cross-domain Sentiment Classification

In this paper, we focus on the tasks of cross-domain sentiment classification. We find across different domains, features with some types of part-of-speech (POS) tags are domain-dependent, while some others are domain-free. Based on this finding, we proposed a POS-based ensemble model to efficiently integrate features with different types of POS tags to improve the classification performance. W...

متن کامل

Active Learning for Cross-domain Sentiment Classification

In the literature, various approaches have been proposed to address the domain adaptation problem in sentiment classification (also called cross-domain sentiment classification). However, the adaptation performance normally much suffers when the data distributions in the source and target domains differ significantly. In this paper, we suggest to perform active learning for cross-domain sentime...

متن کامل

Cross-Lingual Mixture Model for Sentiment Classification

The amount of labeled sentiment data in English is much larger than that in other languages. Such a disproportion arouse interest in cross-lingual sentiment classification, which aims to conduct sentiment classification in the target language (e.g. Chinese) using labeled data in the source language (e.g. English). Most existing work relies on machine translation engines to directly adapt labele...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10244704